Authoring case based training by document data extraction
نویسندگان
چکیده
Background: Modeling is the bottleneck to successful implementation of knowledge management systems. In this paper, we propose an evolutionary approach to modeling based upon word processing documents and we describe the tool Phoenix providing the technical infrastructure. Methods: We applied our approach and software system to authoring of medical case based training systems. So far, authors needed to either hand-code the content (usually as HTML) or to use highly sophisticated authoring systems which require instructions and experience to master the complex systems. With our approach we carry further the ideas Felciano and Dev put into practice in their system Short Rounds [4]. They only presented pre-existing documents as an electronic patient record. Following our approach of evolutionary modeling, authors annotate documents to build fully flavored diagnostic training cases [5]. Results: For our training environment d3web.Train [6, 7], we developed a tool to extract case knowledge from existing documents, usually dismissal records, extending Phoenix to d3web.CaseImporter [8]. Independent authors used this tool to develop training systems e.g. in rheumatology, gastroenterology, and cytology, observing a significant decrease of time for setteling-in (from several month down to 1 hour) and a decrease of time necessary for developing a case (down to 4-6 hours) [9]. Conclusions: This paper describes the general approach and provides an in-depth analysis of the document parsing engine (Phoenix) . To generalize the success of d3web.CaseImporter, we conclude by sketching further http://www.d3webtrain.de Phoenix is available under LGPL open source license from https://sourceforge.net/projects/phoenix-ie/.
منابع مشابه
روش جدید متنکاوی برای استخراج اطلاعات زمینه کاربر بهمنظور بهبود رتبهبندی نتایج موتور جستجو
Today, the importance of text processing and its usages is well known among researchers and students. The amount of textual, documental materials increase day by day. So we need useful ways to save them and retrieve information from these materials. For example, search engines such as Google, Yahoo, Bing and etc. need to read so many web documents and retrieve the most similar ones to the user ...
متن کاملCase Authoring from Text and Historical Experiences
The problem of repair and maintenance of complex systems, such as aircraft, cars and trucks is certainly a nontrivial task. Maintenance technicians must use a great amount of knowledge and information resources to solve problems that may occur. This paper describes a semi-automated tool that sorts through the mass of information a maintenance technician must consult in order to make a repair, t...
متن کاملDocument Analysis And Classification Based On Passing Window
In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...
متن کاملTowards a new authoring environment: overview of some ontology based systems
This paper presents some requirements for a new ontology-based authoring environment. By analyzing some systems that use ontologies for several tasks, we identified some features and purposes and showed how they can contribute to help define a new authoring environment based on ontologies to represent information before a document is published. The systems analysed fulfil specific tasks such as...
متن کاملAspects of Collaborative Authoring in WBT Systems
The paper discusses possibilities of constructing new training objects automatically, on-the-fly as a result of any collaborative activity within a WBT system. Such collaborative activities may include participating in discussion forums, brainstorming sessions, writing document annotations, etc. This feature might be seen as collaborative authoring of training objects in WBT systems. Technical ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/cs/0509040 شماره
صفحات -
تاریخ انتشار 2005